Combinatorial methods in comparative genomics
نویسنده
چکیده
To understand evolution, and to discover how different species are related, gene order analysis is a useful tool. Problems in this area can usually be formulated in a combinatorial language. We regard genomes as signed, circular permutations, and evolutionary operations like reversals (reversing the order of a segment of genes) and transpositions (moving a segment of genes) are easy to describe combinatorially. A commonly studied problem is to determine the evolutionary distance between two species. This is estimated by several combinatorial distances between gene order permutations, for instance the breakpoint distance and the reversal distance. We have in this thesis applied combinatorics to several important problems, leading to these results: • An algorithm for computing, within any factor 1+ε > 1, the minimal number of reversals and transpositions, the latter double-counted for certain reasons, needed to transform one gene order permutation into another. • A good approximative formula for the expected number of reversals between two gene order permutations with b breakpoints between them: tappr(b) = log ( 1− b n(1− 1 2n−2 ) )
منابع مشابه
Statistical and Combinatorial Aspects of Comparative Genomics*
This document presents a survey of the statistical and combinatorial aspects of four areas of comparative genomics: gene order based measures of evolutionary distances between species, construction of phylogenetic trees, detection of horizontal transfer of genes, and detection of ancient whole genome duplications.
متن کاملComparative genomics of human stem cell factor (SCF)
Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...
متن کاملAn Introduction to the China Rice Functional Genomics Program
The China Rice Functional Genomics Program (CRFGP) was initiated in 1999 by the Ministry of Science and Technology of China under the National Basic Sciences Initiative and was expected to last for an initial period of five years. The CRFGP involves 20 research groups from the Chinese Academy of Sciences and some major universities and focuses on the identification of genes controlling flowerin...
متن کاملLarge-scale cis-element detection by analysis of correlated expression and sequence conservation between Arabidopsis and Brassica oleracea.
The rapidly increasing amount of plant genomic sequences allows for the detection of cis-elements through comparative methods. In addition, large-scale gene expression data for Arabidopsis (Arabidopsis thaliana) have recently become available. Coexpression and evolutionarily conserved sequences are criteria widely used to identify shared cis-regulatory elements. In our study, we employ an integ...
متن کاملComparative genomics for reliable protein-function prediction from genomic data.
Genomic data provide invaluable, yet unreliable information about protein function. However, if the overlap in information among various genomic datasets is taken into account, one observes an increase in the reliability of the protein-function predictions that can be made. Recently published approaches achieved this either by comparing the same type of data from multiple species (horizontal co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003